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INTRODUCTION 


Projects  In  Computer  Aided  Climatology  is  a  report  on  the  work  per¬ 
formed  by  Regis  College  Research  Center  under  Air  Force  contract  AF-19628- 
78-C-0153  during  the  period  between  July  1,  1978  and  June  30,  1981.  It 
is  not  inclusive  of  all  efforts  made  by  the  Center  under  this  contract. 
Several  projects  are  reported  on  in  considerable  detail.  They  are  ex¬ 
amples  of  the  type  of  activities  that  were  perceived  as  being  the  most 
productive  research  activities  that  could  be  performed  in  fulfillment  of 
the  stated  objectives  of  the  contract. 

These  activities  are:  1)  the  construction  of  climatological  com¬ 
puter  data  bases  facilitating  the  formulation  and  testing  of  mathemati¬ 
cal/statistical  hypothesis  and  models  and  2)  the  development  of  computer 
aided  technology  for  summarizing  and  communicating  results  derived  either 
directly  f '■om  empirical  data  or  from  theoretical  activities  of  analysis 
and  interpretation  of  scientific  inquiry,  not  a  substitute  for  them.  They 
bring  rhe  vast  power  of  the  electronic  information  processing  revolution 
creatively  to  the  aid  of  the  atmospheric  scientist  and  mathematician. 

The  report  is  divided  into  two  sections.  Section  one  gives  the 
details  of  the  construction  of  two  climatological  data  bases.  The  pro¬ 
cess  and  criteria  for  culling  raw  data,  the  ordering  and  merging  of  data, 
and  the  format  of  the  final  fora  of  the  bases  is  described.  The  infor¬ 
mational  contents  of  both  bases  are  summarized.  Several  applications  of 
one  of  the  data  bates  and  a  comparison  to  other  related  collections  of 
data  are  presented. 


Section  two  is  a  description  of  a  computer  geopolitical  mapping 
system  intended  for  presenting  climatological  and  meterologic.al  informa¬ 
tion.  The  system  is  interactive  and  makes  use  of  the  Lambert  azimuthal 
equal  area  projection  which  was  suggested  to  Regis  by  Irving  Gringorten 
of  AFGL.  This  project  in  computergraphics ,  although  more  difficult  than 
initially  anticipated,  was  surprisingly  satisfying  for  the  author  and 
the  student  assistants  to  work  on. 

This  report  is  to  act  as  a  guide  to  subsequent  workers  on  similar 
projects.  This  is  especially  true  of  Section  II.  The  totality  of  work 
outlined  has  been  tested  for  feasibility  but  not  all  of  it  has  been  added 
to  the  existing  system  as  a  final  piece  of  software.  An  attempt  is  made 
to  indicate  what  is  needed  in  the  future  and  how  to  do  it. 

The  author  would  like  to  express  his  gratitude  to  those  members  of 
the  Regis  community  who  assisted  in  this  effort,  especially  to  the  stu¬ 
dent  employees  of  the  Research  Center,  to  Marion  Polzella,  J.D.,  the 
present  director  of  the  Research  Center  and  to  Sister  Mary  Anne  Doyle, 
Ph.D.,  her  predecessor.  Very  special  recognition  must  be  paid  to  Sister 
Leonards  Burke  who  inspired  all  of  us  at  the  Research  Center  through  her 
life  long  dedication  to  diligent  mathematical  and  scientific  research. 

Also  Sister  Ann  Marie  Grady,  C.S.J.  is  responsible  for  preparing 
the  photograph  of  Figure  II-A-1  on  page  52.  Miss  May  Ngan  of  the  Research 
Center  should  be  recommended  for  cannonization  for  her  efforts  and  pa¬ 
tience  on  typewriting  the  first  draft  of  this  manuscript . 
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SECTION  I 


the  constkuctios  of  too  climatological 


COMPILER  DATA  BASES 
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FIGURE  I-A-l 
A  Whole  Sky  Photograph 

The  above  is  a  representative  whole  sky  photograph  which  is  included 
in  the  following  study.  The  National  Weather  Service  judged  the  sky  to 
be  70%  cloudy  in  a  concurrent  observation.  The  Regis  regional  photograph 
interpretation  is  shown  in  Figure  I-A-4.  The  sun  disk  and  the  anemometer 
stand  are  seen  in  the  photograph.  This  photograph  was  taken  at  1500  LST 
on  June  10,  1966. 

These  photographs  were  originally  taken  to  aid  in  estimating  the 
probability  of  cloud-free-lines-of-sight  which  were  required  for  deter¬ 
mining  the  utility  of  optical  and  infrared  search  and  tracking  sytems. 
Earlier  attempts  to  determine  such  probabilities  including  sunshine  meters 
and  satellite  photographs  had  been  found  to  be  inaccurate  [1], 
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SUBSECTION  I  A 


THE  COLUMBIA,  MISSOURI  PHOTOGRAPHIC  SKY  COVER  DATA  BASE 

1.  The  Original  Data 

The  original  data  consisted  of  slightly  less  than  3000  whole  sky 
photographs  collected  at  Columbia,  Missouri  and  a  concurrent  collection 
of  National  Weather  Service  (NWS)  observations.  The  photographs  were 
taken  at  0900,  1200,  and  1500  LST  between  March  1966  and  September  1969. 
There  were  several  durations  of  time  during  this  period  when  photographs 
were  not  taken  because  of  equipment  malfunction  (c.f.,  Subsection  I-A-3 
of  this  report  for  the  effect  that  this  had  upon  the  seasonal  composition 
of  the  data) . 

Reference  [1]  gives  a  complete  description  of  the  photographic  tech¬ 
niques  and  equipment  employed,  it  also  gives  an  elaborate  description  of 
the  larger  collection  of  photographs  taken  at  Columbia.  For  completeness, 
a  few  details  will  be  given  here.  The  photos  were  taken  using  a  Nikon 
camera,  a  specially  ground  fish-eye  lens,  a  lens  cover,  and  a  sun  shield 
which  followed  the  disk  of  the  sun  (used  to  prevent  over  exposure  of  the 
film).  The  film/filter  combination  was  most  sensitive  to  infrared  which 
produces  greater  contrast  between  cloud  and  blue  sky  than  would  be  ob¬ 
served  by  the  naked  eye.  This  tends  to  slightly  increase  the  amount  of 
cloudiness  observed.  The  paper  of  Lund,  Grantham  and  Davis  [2]  presents 
a  comprehensive  description  of  the  data  selected  for  this  data  base. 

Figure  I-A-l  is  a  representative  sample  of  a  whole  sky  photograph  analyzed 
under  this  Regis  contract. 
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The  NWS  data  were  sent  to  Air  Force  Cambridge  Research  Laboratories 
on  80-column  Hollerith  cards.  The  format  of  these  cards  is  given  in 
Figure  I-A-2.  Out  of  over  three  thousand  cards,  only  three  were  found 
to  be  defective  by  the  criteria  of  improper  characters  in  fields.  A  very 
small  number  of  the  reporting  times  were  missing  from  the  data  sent. 

The  selected  photographs  were  interpreted  and  also  transcribed  to 
Hollerith  cards  in  1977  by  the  Regis  College  Research  Center,  using  the 
following  techniques. 

The  photographs,  which  are  contained  in  reels  of  35-mm  transparen¬ 
cies,  were  projected  by  means  of  a  film  transport  and  projected  onto  the 
screen  which  was  stationed  in  a  darkroom.  The  image  was  adjusted  and 
focussed  so  that  the  outer  rim  of  the  photograph  aligned  with  the  outer 
circle  on  the  screen.  The  orientation  of  the  photograph  could  be  checked 
by  observing  the  position  of  an  aneometer  stand  and  water  tower  (see  the 
example  of  a  whole  sky  photograph  given  in  Figure  I-A-l) . 

The  circular  region  of  each  photograph,  corresponding  to  zZzvcuLwn 
angles  greater  than  five  degrees,  was  subdivided  into  one  hundred  and 
eighty  five  subregions  of  approximately  equal  area.  The  outer  rim,  cor¬ 
responding  to  elevation  angles  of  less  than  five  degrees,  was  ignored 
because  of  haziness  and  other  problems  complicating  interpretation  [2]. 

A  drawing  of  the  entire  circular  region  of  the  photograph,  the  concentric 
inner  circular  region  of  zZzvcUJ.on  angles  greater  than  live  degrees,  and 
the  boundaries  of  185  subdivisions  were  drawn  on  the  face  of  a  projection 
screen.  In  addition,  seventeen  small  circles  (each  of  area  less  than  1% 
of  the  area  of  any  of  the  subregions)  were  also  drawn  on  the  screen. 


Figure  I-A-3  portrays  the  configuration  on  the  face  of  the  projection 
screen . 

For  each  photograph,  the  cloudiness  of  each  of  the  185  subdivisions, 
was  ranked  in  eighths  and  an  integer  between  0  and  8  was  recorded  in  a 
corresponding  column  on  the  coding  sheets.  Also  each  of  the  seventeen 
small  circles  were  ranked  either  "0"  or  "1".  A  "1"  was  assigned  if  and 
only  if  the  center  of  the  circle  was  contained  in  cloud  of  any  sort  or  size. 
This  information  was  also  recorded  on  the  coding  sheets  and  later  punched 
on  80  column  Hollerith  cards  at  the  Research  Center.  This  information 
required  three  cards  for  each  photograph  and  thus  the  complete  data  for 
each  observation  time  were  contained  on  four  cards.  The  first  card  in 
order  was  the  NWS  data  card  and  the  next  three  cards  were  photo  data  cards 
prepared  by  Regis.  The  complete  format  of  all  cards  is  shown  in  Figure 
I-A-2.  Observations  were  included  in  the  study  initially,  if  and  only  if, 
both  the  photograph  and  the  NWS  card  existed. 
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a) NWS  DATA  CARD  FORMAT  b)  PHOTO  DATA  CARD  1  c)  PHOTO  DATA  CARD  2  d)  PHOTO  DATA  CARD  3 
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FIGURE  I-A-2 


The  above  drawing  is  a  reduction  of  the  geometric  figure  drawn  on  the 
projection  screen.  The  actual  fiadLLuA  of  the  outer  circle  on  the  screen 
was  29  cm.  compared  to  7.7  cm.  in  the  above  drawing.  The  seventeen  points 
are  located  in  the  center  of  the  small  circles.  For  the  sake  of  legibility 
neither  the  circles  nor  the  alphanumeric  characters  are  drawn  to  scale. 

The  upper  right  hand  cut  away  is  the  actual  size  of  the  innermost  circle 
(Zenith  angle  5°) .  The  small  circle  for  point  A  is  also  drawn  to  scale 
in  the  cu-  away. 
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2 .  The  Culling  of  Questionable  Observations  from  the  Data  Base 

After  the  cards  were  ordered  and  merged  by  computer,  the  entire  file 
was  made  into  an  UPDATE  "program  library".  UPDATE  is  a  CDC  utility  whose 
function  is  to  facilitate  insertion,  deletion  and  modification  of  card 
images  to  a  computer  file  which  is  called  a  "program  library”  (PL).  In 
addition,  UPDATE  also  automatically  keeps  a  chronological  history  of  all 
changes  made  to  the  PL.  This  record  is  kept  as  part  of  the  PL  itself  and 
is  passed  from  one  generation  to  the  next.  UPDATE  permits  the  user  the 
option  of  purging  any  portion  of  the  PL  history.  Using  the  appropriate 
directives,  a  user  may  restore  unpurged  inactive  data  to  active  status  in 
the  program  library.  Figure  I-A-4  is  a  copy  of  part  of  one  page  of  the 
current  UPDATE  listing  of  the  data  base. 

The  photo  data  punched  at  Regis  was  carefully  checked  by  the  CDC 
6600  for  1)  improper  characters  in  each  data  field,  2)  improper  values 
in  any  field,  and  3)  impossible  times  or  dates.  A  listing  of  all  suspected 
data  was  compiled  and  checked  against  the  coding  sheets  for  correction. 

New  cards  were  punched  and  automatically  inserted  using  UPDATE.  The  re¬ 
sulting  data  set  consisted  of  2931  observations  or  11,724  card  images 
which  were  saved  on  nine  track  magnetic  tape,  system  disk  permanent  files 
and  a  private  disk  pack. 

For  each  observation  Q  the  sum  of  the  number  of  eighths  of  cloudiness 
in  all  of  the  one  hundred  and  eighty-five  subregions  was  computed.  This 
quantity  will  be  denoted  as  E(Q)  and  0  <_  E(Q)  <_  1480.  The  number 
E(Q)/148.0  will  be  called  o  (Q)  or  the  raw  sigma  rank  of  observation  of 
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Q  and  the  rank  of  observation  Q  in  tenths  will  be  denoted  as  o(Q)  * 

*  1  * 

[a  (Q)  +  .5].  The  raw  rank  a  is  a  continuous  variable  between  0  and  10 
while  o  is  one  of  the  eleven  integers  0  to  10. 

A  comparison  of  the  o  ranks  for  each  observation  with  the  v  rank  (the 

number  of  tenths  of  cloudiness  reported  by  NWS)  was  made  by  computer  and 

* 

a  list  of  those  observations  for  which  the  raw  a  (Q)  score  differed  from 
v(Q)  by  more  than  two  was  prepared.  The  coding  sheets  were  checked  for 
each  of  these  observations.  If  any  error  was  found,  new  cards  were  punched 
and  inserted.  If  the  cards  were  an  accurated  copy  of  the  coding  sheet  data 
and  the  discrepancy  was  serious,  the  photographs  themselves  were  reexamined. 
In  some  cases,  the  photographs  were  found  to  be  defective,  i.e.,  over, 
under  or  multiply  exposed  or,  in  a  few  cases,  the  lens  cover  was  apparently 
dirty  or  obscured  by  condensation.  In  such  cases,  the  data  was  removed 
from  the  study. 

In  a  few  other  cases,  the  photographs  seemed  to  have  no  relation  to 
the  NWS  observation.  These  observations  were  also  eliminated  from  the 
data  base  under  the  assumption  that  they  were  out  of  chronological  order. 
Other  photographs  checked  at  the  same  time  seemed  to  correspond  well  with 
NWS  information  but  the  interpretation  of  photographs  contained  on  the 
coding  sheets  seemed  questionable.  It  was  felt  thcC  simply  reinterpreting 
those  that  did  not  agree  with  NWS  observations  and  leaving  those  that  did, 
would  introduce  an  unacceptable  chance  of  bias  in  the  resulting  statistics. 
If  any  photograph  in  a  NWS  category  (those  observations  with  the  same  value 


^Bracket  Notation:  fx]  designates  largest  integer  less  than  x  if  x  >  0. 
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of  v)  was  to  be  reinterpreted,  then  the  entire  NWS  category  was  to  be 
reinterpreted.  This  led  to  the  reinterpreting  and  repunching  of  all 
data  from  v  *  4  tc  v  *  9.  This  encompassed  nearly  25%  of  all  observations. 

The  reinterpreted  data  contained  no  "large”  (>2.0)  differences  be¬ 
tween  photo  observations  and  NWS  observations  and  the  agreement  seemed 
superior  to  the  first  interpretations.  In  a  few  cases,  (the  photo  shown 
in  Figure  I-A-l  for  instance)  repeated  reinterpretation  did  not  improve 
the  argument  between  photograph  and  NWS  data,  however,  the  argument  was 
marginally  within  the  tolerance  of  jo(Q)  -  v(Q)J  <_  1  accepted  in  this 
study.  Both  methods  of  estimating  total  sky  cover,  of  course,  involve 
human  judgments.  When  the  sky  is  broken  into  many  small  disconnected 
regions  of  cloudiness, such  as  is  the  situation  in  Figure  1-A-l ,  judgment 
by  either  method  seems  to  be  strained  and  what  is  seen  on  the  photograph 

is  influenced  by  the  film  and  the  camera  lens.  The  final  RMS  difference, 

* 

for  the  entire  sample,  between  the  raw  o  and  v  scores  for  each  observation 
was  .7.  Of  the  original  observations,  2808  remained.  The  point  data  was 
not  used  to  determine  if  an  observation  was  to  be  kept. 


3.  The  Final  Forms  of  the  Columbia  Data  Base 

The  data  base  now  has  two  forms:  one  is  the  UPDATE  program  library 
which  has  previously  been  described  and  the  second  is  a  shortened  form 
which  is  more  convenient  and  is  less  expensive  to  work  with  for  statis¬ 
tical  analysis  but  does  not  contain  any  NWS  information  other  than  the 
value  of  v  for  each  observation. 

The  program  library  form  of  the  data  base  was  edited  in  late  1978. 
The  new  library  generated  consisted  of  all  old  card  images  with  the  re¬ 
interpreted  ones  replacing  their  predecessors.  The  chronological  record 
of  changes  contained  in  the  library  started  fresh  at  this  time.  The 
chronological  record  of  previous  changes  were  destroyed.  The  immediate 
predecessor  of  each  reinterpreted  observation  was  saved  with  inactive 
status  in  the  new  PL.  Figure  l-A-4  is  a  sample  of  a  small  portion  of 
the  PI.  listing.  Active  cards  are  marked  with  an  "A"  on  the  right  hand 
side;  inactive  with  an  "I".  The  old  data  can  be  restored  individually 
or  en-mass  by  the  use  of  the  proper  directives  to  UPDATE.  This  UPDATE 
program  library  is  saved  on  private  disk  pack  STDATA  as  permanent  file 
CLOUD.  It  is  also  saved  on  9-track  magnetic  tape  SKY001.  It  is  impor¬ 
tant  for  any  user  to  remember  that  UPDATE  files  cannot  be  copied  by  any 
other  CDC  utility  and  must  be  processed  by  UPDATE  itself. 

The  second  form  of  the  data  base  requires  only  14  records  per 
observation  as  opposed  to  32  for  the  longer  version.  Each  record 
consists  of  data  from  one  hundred  observations  (or  1400  words) . 

The  file  is  in  "long  stranger"  type  records  and  is  suitable  for 
FORTRAN  BUFFER  IN  statements  which  were  chosen  because  of  economical 
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factors.  This  form  of  the  data  base  is  intended  for  operational  use  ar.d 
is  regenerated  from  the  UPDATE  program  library  before  periods  of  appli¬ 
cations.  It  requires  less  than  45  seconds  to  generate  from  the  program 
library  and  it  has  not  seemed  worth-while  to  save  it  between  periods  of 
use. 

It  was  mentioned  in  Subsection  I-A-l  that  the  equipment  malfunctioned 
during  certain  periods.  Also,  nearly  two-hundred  observations  were  elim¬ 
inated  as  described  in  Subsection  I-A-2.  Thus,  it  was  not  expected  that 
the  seasonal  distribution  of  observations  would  be  uniform.  Of  the  final 
2808  observations,  24.5 %  were  taken  in  the  spring,  30.1%  in  the  summer, 
21.5%  in  the  autumn  and  23.9%  in  the  winter  (see  Table  I-A-6) . 
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.  A  Summary  of  the  Content  of  the  Columbia  Data  Base 


References  [1],  [2],  and  [3]  ir.  the  bibliography  are  concerned  with 
the  Columbia  whole  sky  photographs.  Paper  [2]  uses  the  Regis  data  base 
for  all  of  its  studies,  however,  only  2805  of  the  2808  observations  were 
utilized  since  three  photographs  of  the  2808  still  remaining  in  the  data 
base  have  been  determined  to  be  of  questionable  quality.  Paper  [3]  pro¬ 
vides  an  example  of  the  application  of  information  contained  in  the  data 
base  pertaining  to  the  spatial  distribution  of  cloudiness. 

Table  I-A-6  is  a  summary  of  the  monthly  distribution  of  cloudiness 
Table  I-A-7  and  I-A-8  reveal  the  remarkable  characteristics  of  the  fre¬ 
quency  density  of  cloudiness:  the  extreme  categories  zero  and  ten  (0-5% 
and  95-100%  cloudy)  comprises  over  one  half  of  the  total  number  of  obser¬ 
vations  and  only  ten  percent  of  the  range  of  cloudiness.  This  is  true 
for  both  regional  photographic  interpretation  (RPI)  and  the  NWS  observa¬ 
tions.  The  extremalness  is  slightly  more  pronounced  for  the  RPI  than  NWS 
observations. 

Table  I-A-9  emphasizes  the  extremal  nature  of  the  data.  The  extremal 
cases  here  are  represented  by  ranks  0  and  1480.  The  first  of  these  is  the 
category  of  photographs  for  which  less  than  one  eighth  of  cloudiness  is 
reported  for  each  subregion.  The  second  is  the  category  for  which  every 
subregion  is  reported  to  be  totally  overcast.  These  two  categories,  to¬ 
gether  comprising  only  one  fourteen  hundred  and  eightieth^  of  the  total 

^The  two  extreme  ranks  "0"  and  "1480"  each  are  "half  ranks"  (i.e.,  each 
represent  on  1/2  the  length  of  the  other  1479  equal  length  intervals.) 

See  Figure  I-A-6. 
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range  of  cloudiness  (each  subregion  is  presumed  to  be  ranked  0  only  if 
one  sixteenth  or  less  is  cloudy),  contain  forty-two  percent  of  the  total 
number  of  observations. 


Table  I-A-10  presents  the  frequency  density  functions  for  the  total 
period  of  observations  taken  at  several  different  locations  in  the  United 
States.  The  data  was  extracted  from  a  few  of  the  ETAC-USAF  observation 
stations  mentioned  above.  Relative  frequency  densities  from  the  Regis 
data  base  (RPI,  NWS  and  point  photographic  interpretation  (PPI)  to  be  dis¬ 
cussed  later)  are  presented  in  the  same  table  for  ease  of  comparison.  In 
addition,  relative  frequencies  from  the  Columbia,  Missouri  ETAC-USAF  data 
specific  to  the  hours  0900-1400  LST  are  included.  These  numbers  are  av¬ 
eraged  to  account  for  the  monthly  variations  of  quantity  of  observation  in 
both  samples.^ 

Examination  of  ETAC-USAF  cloudiness  data  reveals  that  at  nearly  every 
station  in  the  United  States  for  which  data  was  available,  the  extremal 
categories  (rank  0  and  rank  10)  account  for  forty  or  more  percent  of  the 
total  observations  tallied  over  the  entire  period  of  observation  for  that 
station.  The  occurrence  of  any  hour,  at  any  month,  from  any  given  station 
in  which  an  internal  category  contains  more  observations  than  the  sum  of 
the  two  extremal  categories,  is  rare.  Exceptions  are  most  frequent  during 
summer  months,  at  midday,  at  locations  with  warm  moist  climates.  A  study, 
Reference  [4],  of  size,  altitude,  number  and  cover  distributions  of  cumuli 

*Since  the  ETAC-USAF  data  is  presented  in  two  hour  groupings  (0900-1100, 
1200-1400  and  1500-1600  LST)  either  the  last  grouping  had  to  be  ignored 
or  weighed  in  with  an  arbitrary  coefficient.  The  former  option  was  taken. 
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near  Pensacola,  Tlorida,  using  daytice  Angnar  data,  supports  a  conjecrur 
that  extrenalness  Is  .i»i»l  v»an  the  cover  is  predoninately  of  snaU 
scale  size  (presumably  cumulus)  clouds. 


&  .na-vr^K. 


RELATIVE  FREQUENCY 


FRACTION  CLOUDY 

I 

FIGURE  I -A- 5 

The  Ranking  System  in  Tenths 

The  above  histogram  of  relative  frequency  versus  fraction  of  sky  cloudy  | 

for  regional  photographic  interpretation  of  the  Columbia  data  is  presented  i 

to  illustrate  the  ranking  system  in  tenths  used  in  this  report. 

Rank  k  =  0  is  the  collection  of  all  data  with  less  than  five  percent  i 

of  the  sky  cloudy. 

j 

Rank  k  *  10  is  the  collection  of  data  with  more  than  ninety-five  J 

percent  of  sky  cloudy. 

For  other  values  of  k,  i.e.,  k  «  3,  rank  consists  of  all  data  with  ! 

twenty-five  percent  of  the  sky  cloudy  and  less  than  or  equal  to  thirty-five  : 

percent  cloudy.  i 

The  ranks  k  *  0  and  k  «  10  comprise,  in  this  case,  one  tenth  of  the  i 

range  of  cloudiness  but  contain  sixty-one  and  one  half  percent  of  all  data.  j  j 

The  same  idea  is  used  for  the  eighths  and  1/1480  ranking  systems.  The  j  | 

combined  range  of  the  two  extremal  ranks  equal  the  range  of  ordinary  rank.  k  j 
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TABLE  I-A-8 

Enumeration  of  Observations 

by  Month  and  Tenths  of  Cloudiness  Taken  from  NWS  Observations 
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TABLE  I-A-9 


Selected  Details 

of  the  Photographic  Interpretation  Density 


and  Distribution 


The  data  in  the  table  is  listed  for  some  of  the  1481  ranks  of  the 
photographic  data.  The  top  integer  N  for  rank  [r  *  ROW  plus  COLUMN)] 
is  the  number  of  photographs  with  r  fourteen  hundred  and  eightieths  of 
cloudiness  for  the  entire  photograph.  The  middle  number  is  the  value 
of  the  empirical  probability  density  function:  ■  (Nr/Ng) (Ax/Axr) 

where  N  is  the  sample  size  (28C8) ,  Ax  «  1/1480  and  Ax  *  Ax/2  for 
s  r 

r  =  0  and  r  »  1  and  Ax  otherwise.  The  bottom  number  is  the  empirical 

probability  dUfubuticn  function  ?r: 


? 

r 
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5.  A  Historical  Note  on  the  Frequency  Densities  of  Cloud  Cover 


The  property  referred  to  as  extremalness  in  §  I-A-4  was  noted  by 
Karl  Pearson  in  1898  [5].  Pearson  had  theoretically  anticipated  the  ex¬ 
istence  of  natural  phenomenon  with  what  he  referred  to  as  a  "U-shaped 
distribution",  in  an  earlier  memoir  on  skew  variance.  At  the  time  of 

|  this  prognostication,  Pearson  had  no  satisfactory  example  of  such  data. 

The  occasion  of  the  publication  [5]  was  to  report  that  a  Hugo  Meyer  of 

i 

I  Berlin  had  published  a  study  of  the  "degrees"  of  cloudiness  measured  at 

I 

|  Breslau.  The  study  consisted  of  3653  daily  observations  made  during  the 

decade  1876-1886  of  cloudiness  ranked  in  tenths.  This  data  is  reproduced 
as  percentage  relative  frequencies  in  Table  l-A-10  of  this  report.  In 
[5], Pearson  examines  the  moments  and  other  parameters  of  this  first  ex¬ 
ample  of  "U-shaped  distributions". 

Pearson  remarks,  "...  the  method  used  for  determining  the  extent 
of  cloudiness  is  not  entirely  satisfactory,  and,  as  Herr  Meyer  remarks, 
the  observer  must  have  had  some  personal  bias  with  regard  to  the  grade 

9  ..."  on  Page  287  [5].  This  is  a  comment  on  the  reduced  probability 
between  rank  8  and  rank  10  (5.0  to  3.0  to  57.1)  seen  in  the  Breslau  data 
in  Table  I-A-10.  The  c-finion  of  Pearson  and  Meyer,  considering  the  amount 
of  data  that  they  had  access  to,  was  certainly  not  unreasonable.  Today, 
when  hundreds  of  thousands  of  observations  have  been  gathered  from  scores 
of  locations,  and  it  is  known  that  densities  often  show  similar  depres¬ 
sions  at  rank  1  and  also  very  frequently  at  rank  9,  it  is  still  be- 

•  lieved  by  many  climatologists  that  observational  bias  accounts  for  the 

l 

;  local  minima  adjacent  to  the  extreme  ranks. 
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Considering  that:  a)  ETAC-USAF  observations  show  that  these  blips 
occur  at  most  locations  when  tallied  over  the  complete  period  of  the 
study,  but  seldom  happen  at  every  hour  and  at  every  month  for  any  given 
location,  and  b)  that  there  appears  to  be  predictability  to  the  times  of 
maximum  and  minimum  depressions;  a  fundamental  relationship  to  other 
climatological  factors  should  not  be  dismissed.  As  with  extremalness  of 
the  density,  these  depressions  on  casual  examination  seem  to  relate  to 
scale  size  of  the  clouds  (cumulus  in  day,  cirrus  during  night  hours). 

Studies  utilizing  existing  data,  and  selected  experiments  using  mod¬ 
ern  photometric  technique  could  be  used  to  choose  between  the  hypotheses 
of  observational  bias  and  of  functional  relationship  to  other  climatol¬ 
ogical  factors.  Computer  interpretation  of  the  photographic  images  would 
minimize  the  human  bias  factor  and  would,  in  this  way,  be  superior  to  the 
Regis  method  of  interpretation,  which  still  contains  the  possibility  of 
human  bias  in  the  interpretation  of  the  subregion  ranks  in  eighths.  The 
current  study  is  equivocal  on  this  point.  The  adjusted  ETAC-USAF  data 
shows  no  local  minima  at  rank  1  or  9  and  neither  does  the  RPI  data.  The 
concurrent  NWS  data  shows  a  slight  dip  at  rank  1  (18.4  to  4.5  to  4.7). 
Several  slight  depressions  appear  in  the  monthly  RPI  data  (Table  I-A-8) . 

Accurate,  complete  and  insightful  knowledge  of  the  frequency  density 
of  cloudiness  is  indispensible  to  a  meaningful  study  of  the  temporal  vari¬ 
ation  of  percent  cloudiness.  The  density  is  an  indirect  statistical  meas¬ 
ure  of  the  persistence  of  a  given  amount  of  cloudiness  and  thus  is  re¬ 
lated  to  tne  time  rate  of  change  of  the  amount  of  cloudiness;  i.e.,  the 
higher  the  density  of  cloudiness,  the  smaller  the  mean  absolute  time  rate 
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of  change  of  cloudiness.  Feller  devotes  Chapter  III  of  his  monumental 
text,  [6],  to  the  "arc  sine  law",  which  is  a  model  of  a  special  case  of 
an  extremal  density  related  to  the  one  step  random  walk  problem  associ¬ 
ated  with  the  experimental  flipping  of  a  "fair"  coin. 

Specifically,  the  "arc  sine  law"  defines  the  density  of  "first 
passage  times"  for  this  problem.  The  name  "arc  sine  law"  derives  from 
approximating  the  integral  of  f(s)  «  1/ (•"V/s-s* )  over  the  interval  [0,x] 
by  the  expression  (2  arc  sin  )  /"  .  More  general,  one  step  random  walk 
probl  ;  are  related  to  other  extremal  functions  in  subsequent  chapters 
of  [6],  No  simple  relationship  has  yet  been  determined  between  cloudi¬ 
ness  and  one  step  random  walk  processes.  Eventually,  this  may  provide 
a  clue  to  modeling  the  stochastic  relationship  between  cloudiness  and 
temporal-spatial  parameters. 
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TABLE  I-A-10  (Continued) 


6.  The  Seventeen  Points 


A  simple  indicator  of  cloudiness  was  devised  using  the  seventeen 
binary  values  Vq(Q)  (the  value  corresponding  to  points  a  e  {a,b,c,...q} 
for  observation  Q) .  The  formula  for  the  indicator  is: 

P(Q)  -  10  [^(l>a(Q)/17)  +  .  5j 

where  the  bracket  notation  is  used  to  indicate  truncation  to  the  largest 
integer  less  than  the  quantity  enclosed  in  brackets  (c.f.,  ->(Q)  in  §  Sub¬ 
section  I-A-2) . 

The  range  of  values  of  p(Q)  is  the  set  of  eleven  integers  {0,1, ...10} 
as  is  the  range  of  v(Q)  and  o(Q)  .  The  root  mean  square  of  the  difference 
(o(Q)-  c (Q) )  and  (v(Q)-p(Q))  were  computed.  Their  values  were  found  to  be 
respectively  .950  and  1.15  compared  with  the  root  mean  square  difference 
of  (o(Q)-v  (Q))  which  was  found  to  be  .7  (c.f.,  §  Subsection  I-A-2). 

The  significance  of  these  RMS  differences  cannot  be  judged  by  the 
usual  means  (i.e.,  the  formula  on  pages  236-237  of  Cramer,  .7]  ,  since 
the  sample  cannot  be  assumed  to  be  normally  distributed.  A  simple  ad  hoc 
model,  not  presented  here,  shows  that  for  a  sample  with  the  extremal  dis¬ 
tribution  of  v  that,  if  the  were  randomly  chosen  for  each  observation, 
the  expected  value  of  the  RMS  difference  (v(Q)-p(Q))  would  be  4.5.  The 
values  obtained,  .95  and  1.14  would,  according  to  this  model,  be  equiva¬ 
lent  to  the  expected  value  of  the  RMS  of  an  experiment  in  which  the 
were  chosen  randomly  and,  if  j  v— p j  >  3  using  the  formula  for  p,  the  value 
of  p  would  be  readjusted  so  that  J v-pj  *  0. 
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Examination  of  the  actual  data  reveals  that  the  differences 
jp(Q)-v(Q)|  are  occasionally  quite  large  (e.g.,  five,  and  in  one  case 
ten)  and  thus  on  the  average  they  must  be  less  than  one.  The  penalty 
extracted  for  one  value  of  the  difference  being  equal  to  ten  is  that  one 
hundred  must  be  equal  to  rero  for  the  average  to  balance  to  be  one  over 
that  set  of  one  hundred  and  one  observations.  This  good  showing  ri  the 
RMS  differences  is  partially  accounted  for  by  the  fact  that  the  differ¬ 
ences  are  usually  zero  for  extremal  observations  which  are  in  the  majority. 

Tables  I-A-ll  and  l-A-12  show  the  agreement  of  the  point  data  with 
the  subregional  data.  Several  of  the  employees  involved  in  the  photo¬ 
graphic  interpretations  were  questioned  about  point  evaluation  by  the 
author.  Two  of  them  had  the  impression  that  points  WCAC  the  little  cir¬ 
cles  and  would  assign  a  '  if  any  part  of  the  small  region  intersected 
a  cloud.  The  third  employee  understood  the  instructions, but  complained 
that  it  was  difficult  to  determine  where  the  center  of  the  circle  was 
and  further  felt  that  if  she  had  reinterpreted  some  of  the  photographs 
several  times,  that  she  would  have  had  several  different  results  because 
the  processes  were  so  sensitive  to  the  alignment  of  the  photograph  on 
the  projection  screen.  The  ratio  are  invariably  higher  than  expected 
as  is  seen  in  the  Table  I-A-12  from  the  average  results  of  the  combined 
ratio  column.  This  adds  creditability  to  the  possibility  that  the  points 
were  interpreted  as  small  disks  (i.e.,  as  "f ields-of-vision"  rather  than 
as  points  in  the  center  of  the  disk  or  "lines-of-sight" .)  The  differ¬ 
ence  between  the  two  concepts  becomes  more  pronounced  as  the  scale  size 
of  the  cloudy  regions  decreases  (for  an  interesting  elaboration  of  this 
type  of  phenomona,  see  Mandelbrot  [8J. 
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It  is  on  the  whole  satisfying  that  the  overall  results  are  as  good 
as  Tables  I-A-ll  and  I-A-12  indicate  considering  the  crude  techniques 
employed . 

Study  of  the  point  data  uncovered  what  at  first  appeared  to  be  a 
curious  accident.  Each  of  the  binary  values  V'a(Q)  are  surprisingly  good 
indLicO-to-li  of  the  status  of  the  entire  sky.  The  statement:  if  V  (Q)  is 

0 i 

”0"  implies  that  v(Q)  <  .5  and  if  va(Q)  is  "1"  implies  that  (Q)  >  .5, 
is  correct  over  ninety  percent  of  the  time  for  the  entire  data  base.  The 
statement  is  equally  as  good  for  the  indicators  a  and  p  and  is  nearly 
independent  of  which  point  a  is  chosen.  The  best  results  are  obtained 
from  winter  data  (nearly  952)  and  the  poorest  in  summer  (87%).  Point  A, 


in  the  center  of  the  photograph  produced  consistantly  the  best  result  in 
every  season.  The  poorest  overall  performance  was  given  by  points  at 
zenith  angles  of  80°,  but  40°  was  only  slightly  better.  Points  at  20° 
and  60°  were  nearly  as  good  as  point  A  overall.  The  poorest  performance 
was  at  point  I  in  the  summer.  It  was  correct  only  83%  of  the  time. 


For  any  point  a,  P^  is  defined  to  be  the  probability  that  the 
indicator  value  V^fQ)  correctly  represents  the  indicator  value  v(Q)  in  the 
sense  of  the  previous  paragraph.  The  value  of  P^  is  given  by  the  follow- 


where  the  (x^}  are  taken  to  be  representative  values  of  the  category  rank 
k  and  the  P^  a  are  the  relative  frequencies  associated  with  the  indicator 
v  at  rank  k.  The  representative  values  for  numerical  calculations  were 
taken  to  be  {.025,  .1,  .2,  ....  .9,  .975}  respectively.  (C.F.  Figure 


I-A-5) . 


Using  for  the  a  the  frequency  density  for  the  indicator  v(see 
Table  l-A-10) .  the  values  of  Pa  for  all  data  is  .91  and  for  winter  and 
summer  respectively  are  .96  and  .88  which  are  good  approximations  to  the 
results  which  were  empirically  determined.  A  Simpson  rule  integration  of 
the  theoretical  arc  sine  law  of  chapt.  III.  (C.F.  I-A-5)  estimated 

P^  as  .84  and  the  same  technique  applied  to  a  binomial  distribution  with 
p  =  1/2  estimated  P  for  that  distribution  to  be  .67.  The  values  of  P 

r  0.  Ct 

increases  as  the  extremalness  of  the  distribution  increases.  Its  minimum 
value  is  .5  when  all  density  is  concentrated  at  x  <=  .5  and  its  maximum 
value  is  1.0  when  all  density  is  concentrated  at  the  two  points  x  =  0 
and  x  *  1 . 

This  simple  observation  has  potential  scientific,  military  and 
commercial  applications. 
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four  points  are  on  a  circle  of  20  radios.  The  ranks,  in  eighths,  averaged  over  the  surrounding  subregion(s) 
of  the  point  specified  in  the  top  left  corner,  are  listed  in  the  left  column  of  each  table.  The  total 
number  of  observations  of  that  rank  for  the  specified  point  is  given  in  column  T.  The  number  of  observa¬ 
tions  of  that  subregion  rank  are  broken  down  into  clear  and  cloudy  respectively  in  columns  0  and  1. 


'  RANK. 

COMBINED  RATIO 

POINT  A  RATIO 

RANGE 

,  0 

.0142 

.0053 

0 

- 

.0625 

1 

.2235 

.2105 

.0625 

- 

.  1875 

2 

.2899 

.2727 

.1875 

- 

.3125 

3 

.4508 

.4643 

.3125 

- 

I 

.4375  j 

4 

.6458 

.7000 

.4375 

- 

.5625  1 
1 

5 

.7464 

.7353 

.  5625 

- 

.6875  i 

e 

.8237 

.8958 

.6875 

- 

.8125  1 

7 

.9586 

.9705 

.8125 

- 

.9375  i 

!  8 

.9941 

.9946 

.9375 

-1 

.  000  i 

i 

i 

TABLE  I-A-12 

Comparison  of  Point  Ratios  to  Surrounding  Subregion(s)  Ranked  in  Eighths 


Sixteen  of  the  seventeen  points  are  on  the  boundary  of  two  subregions. 
Point  A  is  the  exception j being  contained  only  in  subregion  1.  The  expression 
"rank  of  a  point"  will  mean  the  average  rank  in  eighths  of  the  subregions 
which  contain,  or  border  the  point,  rounded  down  to  the  nearest  integer. 

is  the  fraction  of  the  set  of  all  observations  whose  rank  at  point 
a  is  k,  which  are  recorded  as  being  cloudy  (i.e.,  V  =1).  The  combined 
ratio  in  the  ahove  table  is  the  average  of  R  ^  over  all  seventeen  points. 

If  the  ranking  of  subregions  and  points  were  done  accurately  and 
according  to  instructions,  it  would  be  expected  that  the  combined  ratios 
should  fall  the  range  given  in  the  right  most  column.  Except  for  rank 
k=8  the  combined  ratios  are  higher  than  the  top  of  the  range.  The  indi¬ 
vidual  values  of  R  ^  (R^  is  given  in  Table  I-A-12)  also  usually  are  above 
the  range.  This  is  suggestive  that  human  observers  always  interpret  "point" 
as  "small  region". 


SUBSECTION  1-B 


THE  RADAR  ECHO  COMPUTER  DATA  BASE 


1.  The  Radar  Echo  Data 


The  data  for  this  study  were  gathered  from  seventeen  different 
weather  stations  around  the  United  States.  These  stations  are  itemized 
in  Table  I-B-l .  The  study  persisted  for  a  three-year  period;  from 
January  1,  1973  until  December  31,  1975.  Information  was  to  be  reported 
regularly  at  three-hour  intervals,  and  the  time  was  to  be  reported  in  LST 
for  each  station.  It  was  understood  from  the  beginning  that  there  would 
have  to  be  allowance  for  some  missing  observations.  This  is  discussed 
later  in  this  subsection. 

The  following  instructions  were  given  to  each  station: 

a)  Every  three  hours  a  photogragh  was  taken  of  the  radar  scope  with  the 
antenna  elevation  angle  at  0  . 

b)  If  the  0°  elevation  ang^e  was  clear,  the  data  £or  the  observation 
consisted  only  of  the  0  photograph.  If  the  0  scope  showed  any 
echoes,  the  antenna  was  raised  to  15  elevation,  and  again  the  scope 
was  photographed. 

c)  If  the  15°  scope  was  clear, Qthe  datg  for  the  observation  gonsisted 
only  of  the  photograph  of  0  and  15  .  If  the  scope  at  15  antenna 
elevation  showed  any  echoes,  the  antenna  was  raised  to  30  elevation 
angle,  and  again  the  scope  was  photographed. 

d)  If  the  30°  scope  showed  any  echoes,  the  antenna  was  raised  to  45°o 
elevation  angle,  and  the  scope  was  again  photographed.  If  the  30 
scope  was  clear,  the  data  consisted  of  the  three  photograph  0  ,  15  and 
30  .  If  the  scope  was  photographed  at  45  ,  the  data  consisted  of 
photographs  from  all  four  elevation  angles. 

Certain  exceptions  to  the  above  rules  had  to  be  permitted  to  account 

for  possible  equipment  failure  and  human  resource  limitations.  If  the 

staff  at  the  station  was  preoccupied  with  other  priority  tasks,  for  instance. 
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monitoring  of  a  storm,  the  observation  could  be  missed  entirely  or 
the  above  sequence  only  partially  completed.  If  the  equipment  mal¬ 
functioned,  of  course,  there  was  no  choice.  The  first  type  of  exception 
has  the  potential-  of  introducing  bias  in  the  sample  because  it  could  over 
represent  fair  weather  conditions. 

Information  from  the  photographs  was  abstracted  and  recorded  on 
coding  sheets  for  each  of  the  stations  in  the  following  fashion.  For 
each  photograph,  an  80  column  card  image  (line  on  the  coding  sheet)  was 
to  be  prepared.  The  first  fifteen  columns  were  used  to  indicate/station/ 
data/time/elevation  angle/of  the  photograph.  Column  seventeen  was  to  be 
left  blank  if  there  were  any  echoes  on  the  photograph.  Otherwise  (if  the 
photograph  had  no  echoes)  column  seventeen  would  contain  a  "9". 

Before  the  beginning  of  the  experiment,  two  mutually  perpendicular 
lines  crossing  at  the  center  of  the  scope  (the  position  of  the  radar 
antenna)  were  selected  for  each  of  the  the  stations.  The  selection  was 
made  on  the  basis  of  minimizing  intersection  with  ground  clutter  at  0° 
elevation  angle.  The  four  rays  generated  by  these  two  lines  will  be 

referred  to  as  the  "azimuth  angles".  Templates  were  made  to  be  placed 

o 

on  each  photograph.  At  0  elevation  each  azimuth  angle  was  marked  off 
into  10  equal  segments  representing  10  nautical  miles  apiece.  For 
each  of  the  higher  elevation  angles  the  templates  were  marked  off  in 
seven  equal  subdivisions  each  one  indicating  an  increase  of  10,000  ft. 
in  altitude  at  that  elevation  angle. 

If  no  echo  patterns  intersected  an  azimuth  angle,  a  "9"  was  place 
in  the  header  column  (see  Table  I-B-2)  for  that  azimuth  angle.  Otherwise 
it  was  to  be  left  blank.  If  a  particular  segment  of  one  of  the  azimuth 


angle  was  not  intersected  by  echo  pattern,  the  corresponding  segment 
column  of  that  azimuth  angle  was  left  blank  (c.f.  TABLE  I-B-2) . 

Column  sixty  six  was  used  to  indicate  that  the  scale  was  different 
from  that  used  above.  This  occurred  only  one  hundred  forty  times  in 
the  data  from  all  seventeen  stations.  It  was  decided  not  to  use  such 
data  but  to  leave  them  in  the  data  base.  Also,  it  should  be  mentioned  that 
the  inner  ten  mile  radius  was  blocked  out  at  elevation  angle  0°  since  the 
intensity  of  the  ground  clutter  image  endangers  the  scope  and  makes  read¬ 
ing  of  echo  data  from  this  region  impossible. 

The  analysis  of  the  radar  photographs  was  made  by  meteorological 
technicians  at  the  National  Climatic  Center  (NCC)  in  Asheville,  N.C.  where 
the  data  were  collated  and  punched  directly  onto  two  seven  track  magnetic 
computer  tapes.  Neither  reel  is  magnetically  labeled  and  both  are  written 
in  buffered  (long-stranger)  record  type.  Each  record  is  one  hundred  card 
images  ordered  by  /station/year/mcnth/day/hour/elevation  angle/  in  ascend¬ 
ing  order.  The  second  tape  consists  of  all  1974  and  1975  card  images  also 
with  the  above  ordering.  The  first  reel  contains  a  total  of  60,901  card 
images,  the  second  131,648.  Thus  the  original  tapes  contained  192,549 
card  images. 

Unexpectedly  it  was  discovered  that  card  images  could  not  be  equated 
with  photographs.  This  is  explained  in  the  next  subsection. 
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2 .  Culling  the  Radar  Echo  Data 


The  contents  of  the  tapes  were  merged  onto  a  single  nine  track 
magnetic  tape  and  listed.  The  listing  is  over  three  thousand  pages  long. 
It  was  used  to  verify  that  all  of  the  stations  reported  during  every  month 
of  the  study  and  that  the  data  were  in  the  advertised  order.  The  listing 
also  suggested  a  list  of  problems  with  card  images  which  would  have  to  be 
searched  for  and  edited  from  the  data. 

The  ordering  of  data  was  further  checked  using  the  CDC  utility  SORT/ 
MERGE  which  operates  on  the  "data  key  principal".  At  the  same  time,  using 
SORT/MERGE,  the  data  set  was  transferred  to  private  disk  pack  STDATA.  The 
new  file  incorporated  all  the  information  into  the  order  scheme  /station/ 
year /month/day/hour/elevation  angle/.  It  was  discovered  at  this  time  that 
there  were  ninety  card  images  from  a  mysterious  station  nineteen.  This 
data  was  contained  on  the  original  tapes  but  was  less  noticeable  when 
divided  into  small  packets  than  when  it  appeared  collected  together  in  one 
place.  Attempts  to  reclassify  these  data  as  belonging  to  other  stations 
failed  and  they  became  the  first  data  compiled  on  the  list  of  card  images 
to  be  sent  back  to  NCC  to  be  checked  against  the  original  coding  sheets. 

A  first  version  of  an  editor  program  was  written  which  scanned  all  80 
card  image  columns  checking  for  impossible  dates,  station  numbers, 
elevation  angles,  improper  (non-numeric)  characters,  and  information 
entered  in  improper  columns.  This  program  found  several  hundred  more 
questionable  card  images  for  the  list  of  problems  to  be  sent  to  Asheville. 
It  was  learned  that  several  stations  would  occasionally  place  the  "9" 
indicating  the  scope  was  all  clear  in  column  nineteen  (the  header  column 
for  azimuth  one)  instead  of  column  seventeen.  If  a  "9"  appeared  in 
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column  nineteen  and  the  rest  of  the  card  image  was  blank,  a  "9"  was  added 
to  column  seventeen  and  column  nineteen  was  made  blank.  If  duplicates 
of  a  /station/time/elevation  angle/  identifier  were  found,  both  cards 
were  removed  and  added  to  the  rectification  list  for  NCC. 

The  list  was  sent  to  NCC  for  clarification.  It  contained  approximate¬ 
ly  one  thousand  card  images.  In  the  meanwhile,  a  new  editor  was  developed. 
It  had  several  new  features  as  well  as  all  of  the  previous  ones.  It  per¬ 
mitted  merging  data  into  the  "buffer  structure  of  the  record"  and  it 
checked  for  a  very  peculiar  problem.  Card  images  had  been  accidentally 
discovered  containing  proper  /station/time/elevation  angle/  identifiers 
and  nothing  else  recorded  on  them.  Checking  with  the  stations,  the  cause 
of  this  problem  was  found. 

Stations  would  occasionally  record  the  identifiers  for  observations 
in  advance  being  certain  that  they  would  be  made.  Sometimes,  such  obser¬ 
vations  were  not  made  or  they  were  made  but  the  photograph  was  not  usable. 
The  card  image  containing  only  the  identifiers  were  not  removed  under  the 
assumption  that  they  would  not  be  punched.  But  everything  on  the  sheets 
was  punched  creating  a  number  of  "empty"  observations.  Surprisingly,  the 
number  of  "empty"  card  images  was  nearly  nine  thousand;  over  eight 
thousand  from  one  station  alone.  These  would  all  have  to  be  detected  and 
deleted  by  the  new  editor. 

Another  task  of  the  new  editor  was  to  enumerate  the  observation  types 
for  each  station.  An  observation  consists  of  one  to  four  card  images 
to  be  a  "proper  observation",  according  to  rules  (a)  through  (d) ,  an 
observation  begins  with  an  elevation  angle  0°  card  and  ends  with  a  card 
with  "9"  in  column  seventeen  or  with  a  card  from  elevation  angle  45°. 
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Observation  were  classified  using  two  hexidecimal  digits.  The  first 


digit  indicates  which  of  the  four  card  images  are  present.  The  second 
indicates  which  of  the  card  images  contains  a  "9"  in  column  seventeen. 

The  first  digit  would  be  "l”  if  only  the  card  image  for  0°  is  present,  a 
"2"  if  only  the  card  image  for  15°  is  present,  a  "4"  if  only  30°  is 
present  and  a  ”8"  if  only  45°  is  present.  For  further  illustration,  if 
0°  and  30°  were  present  but  not  15°  and  45°,  the  first  digit  would  be  a 
"5"  and  if  all  four  observations  are  present,  the  first  digit  would  be 
hexidecimal  "F"  (or  in  decimal  notation  a  "15").  The  second  digit 
characterizes  the  existence  of  clear  scope  indicators.  An  observation 
characterized  as  a  "32"  is  an  observation  with  0°  and  15°  card  images 
and  a  clear  scope  at  15°. 

Certain  observation  types  are  clearly  impossible,  i.e.,  14.  Others 
are  suspicious, i . e . ,  51.  This  is  a  case  with  two  card  images,  one  from 
0°  and  one  from  30°.  The  data  from  0°  indicates  that  the  scope  was  clear. 
The  card  at  30°  indicates  that  there  is  precipitation.  A  situation  which 
is  not  impossible  but  not  very  likely.  It  would  seem  that  what  is  likely 
in  this  case  is  that  the  30°  card  had  the  wrong  data  punched  on  it.  The 
only  observation  types  consistent  with  (a)  to  (d)  are  11,  32,  74,  F8,  or 
F0. 

NCC  eventually  returned  a  report  on  the  list  of  problems  we  had  sent 
them.  They  had  managed  to  unscramble  over  six  hundred  of  them.  Cards 
were  repunched  and  verified  at  the  Regis  Research  Center.  The  new  editor 
had  found  new  problems  some  of  which  Regis  was  able  to  unscramble.  More 
data  were  punched  and  approximated,  and  eleven  hundred  new  cards  were  re¬ 
inserted  into  the  data  base  and  were  subsequently  checked  by  the  editor. 
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3.  The  Final  Form  of  the  Radar  Echo  Data  ^ase 

All  card  images  surviving  the  culling  described  in  §  I-B-2  above 
are  included  in  the  current  data  base.  It  is  written  on  both  disc  pack 
STDATA  (permanent  file  GD5N)  and  on  the  nine  track  magnetic  tape  GD5M. 

In  both  cases  it  is  buffered  out  as  100  card  image  records  and  required 
approximately  25000  PRU‘s  of  space.  The  tape  is  written  at  1600  bits 
per  inch  and  is  magnetically  labeled  GD5M.  It*,  records  are  of  "long 
stranger"  type  on  the  CDC6600. 

For  actual  use  of  the  data  base,  a  preselector  of  data  was  used. 

The  preselector  only  processes  card  images  of  specified  observation  types 
Two  classes  of  observations  were  studied  under  the  current  Regis  Contract 
they  are  designated  as  Class  A  and  Class  B  data.  The  set  of  Class  A 
observations  are  characterized  by  the  first  type  digit  being  a  "1",  "3", 
"7",  or  "F".  Class  B  observations  contain  only  observation  types  (1,1), 
(3,2),  (7,4)  and  (F,8)  and  thus  strictly  adhere  to  the  rules  a),  b) ,  c) 
and  d)  stated  in  subsection  I-B-l  of  this  report.  For  further  clarifica¬ 
tion  of  the  observation  types  and  digits  see  TABLE  I-B-3. 

The  amount  of  Class  A  and  Class  B  data  in  the  data  base  for  each 
of  the  seventeen  stations  is  enumerated  in  TABLE  I-B-4. 


STATION 


LOCATION 


1 

ATLANTIC  CITY,  NJ 

2 

BRUNSWICK,  ME 

3 

CHATHAM,  MA 

4 

DAYTONA  BEACH,  FL 

5 

DETROIT,  MI 

6 

HATTERAS,  NC 

7 

HONDO,  TX 

8 

JACKSON,  MS 

9 

KEYWEST,  FL 

10 

LI MON,  CO 

11 

MEDFORD,  OR 

12 

MIDLAND,  TX 

13 

MINNEAPOLIS,  MN 

14 

MISSOULA,  MT 

15 

SACRAMENTO,  CA 

16 

SAINT  LOUIS,  MO 

17 

SLIDELL.  LA 

TABLE  1-3-1 

The  List  of  Stations  Reporting  Radar  Echo  Data  Used  in  This  Study 


The  data  was  originally  gathered  to  produce  a  radar  based  climatology 
for  a  variety  of  locations  with  dissimilar  climatological  conditions.  The 
data  shows  ground  level  and  several  other  elevation  angles  which  were 
concurrently  measured  by  radar. 
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COL 

1-2 

3 

4-5 

6 

7-8 

9-10 

11-12 

13 

14-15 

16 

17 

18 
19 

20-29 

30 

31 

32-41 

42 

43 

44-53 

54 

55 

56-65 

66 


USE 

Station  number  (2  digits) 

Always  blank 

Year  (2  digits  :  73,  74,  75) 

Always  blank 

Month  (2  digits  :  1,...,  12) 
day  (2  digits  :  1,...,31) 
hour  (2  digits) 

Always  blank 
Elevation  angle 
Always  blank 

Observation  herder  column  (blank  or  9) 
if  "9"  the  scope  is  clear 
Always  blank 

Azimuth  angle  1  header  (blank  or  9) 
if  "9"  there  are  no  echoes  intersecting  line 
Azimuth  angle  1  data 
Always  blank 

Azimuth  angle  2  header  (blank  or  9) 

Azimuth  angle  2  data 

Always  blank 

Azimuth  angle  3  header  (blank  or  9) 

Azimuth  angle  3  data 

Always  blank 

Azimuth  angle  4  header  (blank  or  9) 

Azimuth  angle  4  data 

Special  scale  indicator 

TABLE  I-B-2 


Card  Format  of  Radar  Echo  Data 


Azimuth  angle  data  consists  of  the  character  blank  representing 
zero  or  the  digits  one  to  eight.  For  elevation  angle  0°  all  ten  columns 
are  used.  For  higher  elevation  angles  only,  the  first  seven  columns  are 
use  . 
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HEXIDECIMAL  NOTATION  DECIMAL  NOTATION  45°  30°  15°  0° 


The  Two  Digit  Hexidecimal  Classification  System 

The  first  digit  is  assigned  a  value  from  0  -  IS  according  to  the 
answers  to  the  questions  "does  a  record  of  the  scoge  atQelevation  angle 
N  exist  in  the  data  base?"  for  each  value  of  N  =  0  ,  15  ,  30  and  45  . 

In  the  above  table  "¥"  represents  yi*>  and  "N”  represents  no. 

The  second  digit  classifying  an  observation  is  assigned  its  value 
according  to  the  question  "was  the  scope  recorded  as  being  entirely  clear 
at  elevation  angle  N?". 
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STATION 

NUMBER  OF 
0  0 

0  15 

CARD  IMAGES 

30°  45°  TOTAL 

#  OF  OBSERVATION 

CLASS  A  CLASS  B 

1 

8123 

2052 

899 

865 

11939 

8123 

7427 

2 

8356 

2723 

851 

550 

12480 

8356 

7729 

3 

8330 

714 

353 

121 

9518 

8330 

6104 

4 

8121 

3800 

918 

628 

13467 

8121 

7177 

5 

7184 

342 

156 

72 

7754 

7184 

5475 

6 

7593 

2888 

508 

253 

11242 

7593 

6661 

7 

7683 

893 

142 

73 

8791 

7683 

6833 

8 

7677 

1368 

354 

168 

9567 

7677 

6302 

9 

6659 

3055 

736 

410 

10860 

6659 

4645 

10 

8326 

919 

238 

74 

9557 

8326 

7107 

11 

8173 

1692 

362 

16 

10243 

8173 

8052 

8101 

1313 

151 

81 

9646 

8101 

7632  | 

- 

7600 

673 

66 

31 

8370 

7600 

6852 

1  • 

7740 

1409 

244 

84 

9477 

7740 

7198 

13 

8507 

459 

98 

17 

9081 

8507 

7306 

* 

7611 

2063 

709 

305 

20688 

7611 

6878 

17 

8041 

3399 

1146 

497 

13083 

8041 

. 

7205 

TOTALS 

X 

•o 

29762 

7931 

4245 

175763 

133825 

116583  j 

TABLE  I-B-4 

Itemization  of  Data  Contained  in  Radar  Echo  Base 

The  above  table  itemizes  both  the  number  of  cards  and  the  number 
of  observations  for  the  two  cases  of  observations  described  in  this 
report  (C.F.  §I-B-3).  The  number  of  card  images  for  a  particular  eleva¬ 
tion  angle  is  equivalent  to  the  number  of  reports  of  information  at  that 
station  and  elevation  angle  for  type  A  observation.  The  total  number  of 
cards  is  less  than  the  total  number  for  the  entire  data  base  which  are 
neither  type  A  or  type  B,  e.g.,  an  observation  with  data  at  15°  and  30° 
but  none  at  06.  This  situation  could  arise  if  the  0°  photograph  was 
missing  cr  unusable. 


1 


FIGURE  II-A-I 


Photograph  of  Globe  and  Projection 

The  viewer  sees  somewhat  less  than  180°  of  the  sphere.  The  refer¬ 
ence  point  chosen  is  Cairo  (30°N,  30°E)  and  the  point  located  is  Sri  Lanka 
(7°N,  78°E) .  The  spherical  triangle  referred  to  in  the  text  has  the  north 
pole  and  these  two  locations  as  its  vertices.  The  Figure  II-A-2  illustrates 
each  of  the  coordinate  systems  referred  to  in  the  text. 

The  actual  projection  of  latitude-longitude  lines  from  this  reference 
point  is  given  later  in  Section  II-B  for  several  different  radii  between 
30°  and  180°.  Unlike  the  above  photograph  both  north  and  south  pole  is 
visible  on  some  of  the  projections. 
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SECTION  PREFACE 


The  computer  display  system  described  here  permits  automatic  display 
of  meteorological,  climatological  or  statistical  data  referenced  to  the 
surface  of  the  earth.  The  program  is  interactive.  On  cue,  the  user 
specifies  the  "types"  of  maps  requested,  the  boundaries  of  the  region  and 
details  of  the  background  grid  of  longitude-latitude  lines  to  be  drawn, 
the  maximum  size  of  the  map  desired,  and  the  file  name  on  which  the  data 
is  stored,  (if  it  is  to  be  automatically  presented  to  the  system).  The 
option  of  plotting  geographic,  political  and  topological  data,  as  well  as 
data  contours  of  physical  quantities,  are  to  be  specified  by  the  user. 

The  map  is  then  displayed  on  the  Tektronic  scope  and  may  be  xeroxed 
and,  if  desired,  sent  to  the  facilities  main  CRT  for  hard  copy  photographs. 
A  pen  and  ink  capability  is  also  optional. 

SUBSECTION  I I -A 

THE  EQUATIONS  OF  TRANSFORMATION 

The  mapping  procedure  used  was  originally  proposed  by  the  Swiss 
mathematician,  Johann  Heinrich  Lambert,  in  1759  and  is  known  as  the  Lambert 
azimuthal  equal  area  map  or  sometimes  simply  as  the  equal  area  map.  Its 
advantage  over  other  more  widely  used  projections  is  that  on  meso  or  synop¬ 
tic  scale  it  is  relatively  free  of  areal  or  angular  distortion.  On  a 
global  or  greater  than  hemispheric  scale  it  presents  a  comprehensible 
picture  of  large  scale  patterns. 
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Let  (>.,40*  be  the  representation  of  the  location  of  a  point  on  the 
surface  of  the  eart’-.  The  angle  X  is  its  longitude,  A  *  0  representing 
the  Meridian  at  Greenwich,  J.K.;  A  increasing  in  the  easterly  direction 
(-180°  <  A  <  180°).  The  angle  4  representing  latitude  (-90°  <  4  <  90°), 

4=0  representing  the  equator;  4  increasing  in  the  northerly  direction. 

A  point  (Ao,4o)  will  be  specified  and  called  the  "reference  point”  of  the 
projection.  A  second  spherical  coordinate  system  is  specified  in  terms  of 
the  specified  reference  point.  A  point  in  the  second  coordinate  system  is 
given  by  the  angular  pair  (£,4). 

An  arbitrary  point  (A, 4)  in  the  first  coordinate  is  expressed  in  the 

second  (£,4)  by  equating 4  with  the  shortest  great  circle  distance  between 

('«  ,4  )  and  (A, 4);  thus  0°  <  4  <  180°.  The  angle  £  between  the  meridian 
00  -  - 

passing  through  (*Q»4  )  and  the  great  circle  arc  connecting  (A^.4^)  to 

(A, 4)  by  the  shortest  path  measured  in  the  clockwise  direction.  This  is 

illustrated  in  Figures  II-A-1  and  U-A-2. 

The  point  (A  ,4  )  corresponds  to  the  degenerate  circle  (£.0)  (actually 
o  o 

the  point  4  =  0).  which  is  the  "pole"  of  the  second  coordinate  system. 

This  new  coordinate  system  is  mapped  onto  a  polar  coordinate  system  on  the 
plane  (n,R) .  It  is  in  the  specification  of  this  transformation  where  Lambert' 
contribution  is  seen. 

1This  section  uses  the  coordinates  in  the  order  (A, 4)  rather  than  the  usual 
climatological  order  of  "latitude,  longitude"  because  it  is  more  sugges¬ 
tive  of  the  (x,y)  plot  coordinates  and  because  it  is  commonly  used  in 
papers  in  dynamic  Meterology. 


The  angle  n  is  very  simply  expressed  as: 

1)  n  =  (n  is  measured  clockwise  from  the  y-axis) 

The  radius  R  was  given  by  Lambert  as: 

2)  R  =  (2(l-cos<i>) )  0  <  R, 

The  polar  coordinate  system  (n,R)  is  then  transformed  into  the  Cartesian 
coordinate  system  which  is  fed  into  the  plotting  routines  by  the  following 


elementary  equations: 

3)  X  =  R  sin  n  and 

4)  Y  =  R  cos  n. 

What  remains  is  to  convert  an  arbitrary  (X,<J>)  into  (S.'i')  .  From 
Figure  II-A-  it  is  apparent  that  triangle  abc  is  a  spherical  triangle, 
and  the  law  of  cosines  from  spherical  trigonometry  may  be  applied  to  ob- 
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This  spherical  disc  is  mapped  into  a  planar  disc  about  the  origin  of  radius 
R  =  SR  1-cos  C)  and  of  area  A' : 


10)  A'  =  it(2(1-cos  C))  =  A  . 

c 

Thus,  the  term  "azimuthal  equal  area"  is  appropriate.  The  distortion  to  an 
arbitrary  infinitesimal  area  element  d  i|)  d  £  on  the  sphere  when  it  is  mapped 
to  the  x-y  plane  is  given  by  the  following  argument.  Let  B  be  any  variable 
(for  which  and  ^  are  defined  at  each  point  (ip,£)  on  the  sphere 


then: 

,  dR  =  j)R  dj;  3R  d_v|_ 

dB  ‘  3£.  dB  3^  4B  * 

Substituting  ii  for  B  and  using  equation  5)  this  becomes 

ii  \  dR  3R  sin  tj/  .  ,,.*5 

lla)  d*  *  3^  *  ~T~  "  sin  v(2(1-cos  *>>  > 

and  thus  the  areal  distortion  of  the  mapping  is  shown  in  the  following 
equation: 

12)  d  R  d  n  =  d  *d  5  • 

K 


Distortion  is  seen  to  depend  only  on  the  distance  of  the  region  from  the 
reference  point.  Table  Il-A-1  compares  areal  distortion  with  the  value  of 
the  angular  distance  iji.  A  map  of  a  region  of  15°  radius  or  less  has  areal 
distortion  of  less  than  1%  and  this  distortion  is  mainly  on  the  outer  peri¬ 
meter  of  the  map. 


The  map  is  not  conformal  nor  does  it  even  preserve  right  angles.  The 
analysis  of  angular  distortion  has  been  carried  out  and  is  very  much  more 
complicated  to  summarize  than  areal  distortion.  Its  effects  can  be  seen 
on  the  maps  included  in  Figures  II-B-1  and  II-B-2 . 
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V(in  degrees) 
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Contraction 

1.0000 

0.9990 

0.9962 

0.9914 

0.9848 

0.9659 

0.9396 

0.9063 

0.8660 

0.8192 

0.7660 

0.7071 

0.6427 

0.5736 

0.5000 

0.4226 

0.3420 

0.2588 

0.1736 

0.0872 

0.0000 


TABLE  1I-A-1 

Contraction  of  Infinitesmal  Area  Elements 


The  values  listed  are  values  of  sini|>/R.  The  projected  element  is 
always  smaller  than  element  d d  6  but  this  is  barely  noticeable  at 
synoptic  scale.  On  a  circle  of  radius  15°  on  the  globe  it  is  less  than  1%. 

Contraction  refers  to  the  ratio  of  the  infinitesmal  area  element 
d  R  d  n  on  the  x-y  plane  to  the  infinitesmal  area  element  d  rp  d  £  at  point 
(<p,£)  on  the  sphere. 


SUBSECTION  II-p 


TWO  TYPES  OF  REGIONS 


The  current  computer  program  permits  two  types  of  regions  to  be  mapped. 
The  regions  will  be  referred  to  as  1)  "rectangular"  and  2)  "cameo”.  The 
boundary  for  a  rectangular  map  is  specified  as  rectangle  in  longitude- 
latitude  coordinates.  The  map  is  always  plotted  with  its  geometric 
center  in  the  center  of  the  map  plot  but  its  reference  point  need  not  be 
the  geometric  center. 

The  boundary  for  the  cameo  method  is  a  circle  with  center  (X  ,$  ),  the 

o  o 

reference  point.  The  region  is  the  locus  of  all  points  ( X ,  )  whose  great 

circle  distance  from  (X  ,  <t  )  is  less  than  or  equal  to  an  input  radius  0  . 

oc  r  o 

Cimeo  maps  at  present  must  always  have  (  X  ,  <>  )  as  their  geometric  center. 

The  application  of  the  two  types  of  regions  are,  1)  the  rectanqulur 
region  is  convenient,  esthetic  and  conventional  for  the  meso  and  synoptic 
scales  and  permits  labeling  of  longitude  and  latitude  lines  alonq  the 
boundary;  2)  the  cameo  was  intended  for  large  regions  of  hemispheric  or 
greater  scale  and  it  does  show  large  scale  phenomena  and  its  relation  to  a 
small  included  area  to  advantage  (it  is  not  without  some  appeal  at  small 
scale  sizes) . 

Figures  II-B-1  and  Figure  II-B-2  depict  several  examples  of  the  Cameo 
type  regions.  They  use  Cairo  (30OE,  30°N)  as  the  reference  point.  Ref¬ 
erence  (9)  presents  several  examples  of  the  rectangular  type  region. 

o 

The  map  in  Figure  II-B-1  is  made  with  i|i  =  120  .  This  map  corresponds 
to  the  photograph.  Figure  II-A-1.  The  photograph  shows  a  plane  intersecting 
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a  globe  at  the  distance  of  120°  from  the  point  (3C°E,  30°N) .  The  map  is 
a  representation  of  everything  on  the  sphere  above  the  plane  in  the  photo¬ 
graph.  Notice  that  both  the  north  and  south  poles  are  contained  in  the  map 
but  not  the  photograph. 
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This  displays,  in  Lambert  equal  area  projection,  the  entire  region 
of  the  globe  above  the  plane  intersection  shown  in  the  photograph  Figure 
II-A-1.  Notice  that  both  the  north  and  south  polar  regions  appear  in  the 
above  graphic.  The  map  is  centered  at  Cairo  30°  E,  30°  N  and  uses  ij>  =  120 
The  latitude-longitude  grid  lines  are  spaced  at  10°.  The  polar  caps  have  5 


radius. 


FIGURE  II-B-2 


Several  Other  Cameo  Type  Regions 

All  four  of  the  above  grids  are  drawn  with  the  reference  point 
(30®E,  30°N)  as  the  geometric  center.  The  spacing  of  latitude/longitude 
lines  is  indicated  by  A.  The  grid  i3  aligned  with  the  point  (0°,  0°). 
Notice  that  map  a)  shows  both  the  north  and  south  polar  cap.  The  southern¬ 
most  extent  of  d)  is  on  the  equator. 


SUBSECTION  II-C 


DRAWING  ARCS  OF  LATITUDE  AND  LONGITUDE  CIRCLES:  THE  INTERSECTION  PROBLEM 

After  the  user  defines,  interactively,  the  region  (type,  locations 
and  size)  the  system  offers  the  option  of  specifying  which  longitude  and 
latitude  lines  should  be  plotted  on  the  map.  The  boundary  of  the  region 
is  always  drawn  on  the  map.  It  is  possible  to  specify  that  the  grid  of 
latitude-longitude  not  be  plotted.  An  example  of  the  interactive  ques¬ 
tion  and  answer  process,  used  in  the  case  of  a  cameo,  is  given  in  Figure 
II-E-1.  This  is  a  xerox  copy  from  the  Tektronic  scope  screen. 

When  a  particular  curve  of  constant  longitude  or  latitude  is  chosen 
by  the  user,  the  program  must  find  the  points  of  intersections  of  the 
specified  curve  with  the  boundary  of  the  region  and  draw  only  the  portion 
of  the  curve  which  is  interior  to  the  boundary.  This  is  a  relatively 
simple  task  with  the  rectangular  regions  and  a  surprisingly  intricate 
problem  for  the  cameo  type  regions.  The  details  of  the  method  of  solu¬ 
tion  of  the  constant  longitude  curve  problems  for  cameo  regions  is  given 
here.  It  is  a  paradigm  for  other  intersection  problems  which  arise  in 
the  drawing  of  geopolitical  boundary  and  of  contour  data.  The  constant 
latitude  curve  problem  is  similar  to  the  constant  longitude  curve  prob¬ 
lem  but  somewhat  simplier. 

Equation  (5)  can  be  used  together  with  the  Pythagorean  theorem  to 
obtain  an  equation  for  the  sines  and  cosines  of$j  the  values  of  latitude 
at  which  the  intersection  of  the  boundary  and  a  given  curve  of  constant 
longitude  occur.  A  curve  of  constant  longitude  on  the  sphere  is  the  arc 


of  a  great  circle  passing  from  the  south  pole  to  the  north  pole.  Let  ijt 

be  the  "radius"  of  the  cameo  with  reference  point  (X  ).  Substitution 

o  o 

into  equation  (5)  gives: 


13a)  cosO  *  sin4>_sin$  +  cos<+>,cos£  cosy 
o  I  o  I  o 

where  y  =  X-X  .  Applying  the  Pythagorean  theorem  (3a)  becomes: 

2  2  2  2 

13b)  (cosi(i  -(cos<j>  cosy)cos$)  ■  sin  $  -sin  $  cos  9 
o  o  oo 

which  is  a  quatratic  equation  in  the  variable  cos4>.  Using  the  classical 
quadratic  formula  two  roots  c+  and  c  may  be  found  which  are  possible 
values  of  cos  4^: 


/  2  ~  2  2  2 
cos^cosi?  cosy  +  sin^^/sin  <f>o  +  cos  4>oCos  y  -  cos 


13c)  C, 


2  2  2 
sin  4>  +  cos  9  cos  y 

o  o 


and  tentative  values  of  the  sines  can  be  computed  as: 

13d)  S,  *  (cos4<  -C.Cos*  Cosy)/sin$  . 

+  o+o  o 

‘If  the  quantity  under  the  radical  in  (13c)  is  not  positive,  there  are 

no  intersection  points.  In  this  case  the  complete  curve  is  either  interior 

or  exterior  to  the  region.  It  is  interior  if  and  only  if  one  point  of  the 

curve  is  interior.  Two  points  are  particularly  easy  to  check  and  have 

special  properties  that  make  them  of  interest  in  themselves,  they  are  the 

north  pole  and  the  south  pole.  The  north  pole  is  interior  using  (5)  if 

and  only  if  cosik,  _  >  ccsi|<  .  But  cosik,  _  ■  cos(tt/2-4>  )  “  sin$  .  Like- 

N.P.  o  N.P.  o  o 

wise  the  south  pole  is  interior  if  and  only  if  costy  p  >  cosipQ  and 
cosv_  p  *  -sin4>  from  which  the  relations: 

1A)  3inp  >  coat  and 
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15)  -sin<f  >  cosO 
o  o 

become  the  criteria  for  the  inclusion  of  the  north  pole  and  the  south  pole 

respectively  in  the  interior  of  the  region.  If  (14)  is  not  satisfied,  and 

it  has  been  determined  that  there  are  no  intersections,  then  nothing  is 
plotted.  If  (14)  is  satisfied,  the  north  pole  is  connected  to  the  south 
pole  with  a  curve  of  constant  longitude. 

If  the  radical  in  (13c)  is  positive,  then  C+  and  C  are  proper  inter¬ 
sections  if  and  only  if  0  <  C  <1.  If  the  value  is  greater  than  unity, 
it  cannot  be  a  cosine.  If  its  value  is  less  than  zero,  it  could  not  be 

in  the  first  or  fourth  quadrant  and  thus  the  latitude  would  not  be  between 

-90°  and  +90°  i.e.,  the  intersection  is  in  another  hemisphere.  If  neither 
C+  nor  C  are  proper  intersections,  the  problem  is  treated  as  though  the 
radical  was  negative. 

If  C+  and  C  are  both  proper  intersections,  then  if  (14)  is  satisfied, 
(15)  must  also  be  satisfied.  In  this  case,  the  north  pole  is  connected  to 
C+  and  C  is  connected  to  the  south  pole.  If  neither  (14)  nor  (15)  is 
satisfied,  then  C+  is  connected  to  C_.  If  either  C+  is  an  intersection 
and  C  is  not  or  vice  versa,  and  if  both  (14)  and  (15)  are  satisfied,  then 
the  north  pole  is  connected  to  the  south  pole  (if  this  is  the  case,  the 
curve  must  be  tangent  to  the  boundary  at  the  proper  intersection  point). 

If  only  one  of  (14)  or  (15)  are  satisfied,  then  the  existing  proper  inter¬ 
section  is  connected  to  the  interior  pole.  If  both  poles  are  exterior,  an 
error  has  been  made. 

Latitude  lines  are  not  actually  plotted  all  the  way  to  the  poles  as 
too  many  lines  converging  on  a  point  cause  "blotting"  in  the  case  of  pen 


I 


and  in*  drawing  and  "blooming"  in  caso  of  CRT  piora.  Tbs  «*  is 

to  supply  a  snail  radius  about  tbe  polos.  Tbe  system  plots  the  promotion 

,,  .  T-o-rm-i nates  longitude  lines  on  their  boundaries 

of  these  small  caps  and  terminates  longicua 

ratbor  than  at  tbe  polos  tbonsolues.  Tbe  sane  »ou!d  bo  ttu.  of  geopolitical 
maps  and  cootouts.  five  dogroos  seems  to  «oti  ».U  on  global  ado  naps 
and  less  may  be  used  for  smaller  maps. 
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SUBSECTION  II-D 


CONTOURS  AND  GEOPOLITICAL  BOUNDARIES 

The  term  contour,  in  this  section,  refers  to  a  collection  of  curves^- 

on  the  surface  of  the  sphere.  Each  of  these  curves  will  be  referred  to 

as  a  segment  of  the  contour  and  a  segment  containing  only  one  point  will 

be  called  a  degenerate.  The  system  treats  all  contours  uniformly,  whether 

they  represent  geopolitical  boundaries,  topographical  data,  or  curves  of 

the  constant  values  of  a  physical  quantity.  The  line  used  to  represent 

different  contours  may  vary,  i.e.,  the  line  width  on  CRT  plots  and  the 

line  types  on  pen  and  ink  drawing  or,  whether  and  how  lines  are  labeled, 

may  vary  according  to  the  users  specification  but  the  basic  format  in  which 

the  data  is  presented  is  uniform.  This  section  is  concerned  with  the  form 

2 

of  the  basic  data  structure  employed  by  the  system  for  contours. 

The  contour  of  a  continental  outline  map  with  a  rather  skimpy  resolu¬ 
tion  of  0.1°  in  longitude/latitude  would  require  more  than  50,000  points 
for  the  entire  globe.  Since  each  point  is  represented  by  two  coordinates, 
in  long  form,  this  would  take  more  than  100,000  words.  This  is  an  exces¬ 
sive  amount  of  core  for  even  the  largest  computer  system  and  would  be  im¬ 
possible  at  the  AFGL  computer  center.  The  other  extreme  is  to  read  the 
map  into  core  one  point  at  a  time.  This  would  require  approximately  one 


^Curve  here  means  a  geometric  configuration  homeomorphic  with  either 
a  bounded  closed  interval  of  the  real  line  or  with  a  circle. 

2 

A  "data  structure"  in  computer  science  is  a  data  base  containing 
implicit/explicit  instructions  on  the  location  of  the  next  piece  of 
information  and  how  it  is  to  be  processed. 
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minuted  of  I/O  time  for  the  simple  example  above.  Considering  the  delays 
using  intercom  nearly  ten  minutes  would  pass  before  the  map  were  drawn 
after  the  user  requested  it:  painfully  and  impractically  slow. 

All  practical  mapping  systems  must  follow  a  middle  course  between 
these  two  extremes.  Unfortunately,  the  optimal  course  is  machine  depend¬ 
ent.  The  method  described  here  uses  two  CDC  6600  special  features,  not 
common  to  other  manufacturers'  computing  system.  For  this  reason,  care  must 
be  taken  that  the  programming  of  contour  input  and  processing  be  modular 
in  nature  and  easily  changed  should  AFGL  change  systems  or  if  the  program 
is  to  be  used  at  other  facilities. 

The  tvo  special  CDC  features  are  the  60-bit  word  length  and  the 
BUFFER  TN/OUT  capability.  The  length,  in  bits,  of  each  computer  word  de¬ 
termines  the  amount  of  information  (longitude-latitude  coordinator)  and 
logic  that  car.  be  packed  into  one  word.  The  BUFFER  IN  feature  allows  the 
system  to  input  one  block  of  data  simultaneously  with  the  processing  of  a 
previous  block.  Since  input  time  overlaps  with  the  computation  time,  from 
the  users  standpoint,  input  time  is  of  no  practical  importance  using  the 
BUFFER  feature.3 

«  tentative  format  for  the  data  structure  for  contouring  in  this 
system  has  been  devised.  Small  auxilliarv  programs  are  needed  to  convert 


3 

At  most  modern  computing  facilities,  AFGL  included,  I/O  time  is 
recorded  separately  from  CPA  time  and  is  charged  at  a  different  and 
aigher  rate.  Even  so,  the  BUFFER  feature  is  worthwhile  because  in 
an  interactive  environment  it  causes  fewer  and  shorter  delay  time 
than  conventional  1/6  methods. 
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existing  computer  map  tables  and  the  output  of  contour  routines  to 
accommodate  this  format  and  input  record  type. 

Each  segment  begins  with  several  "header"  words.  They  include  one 
word  which  contains  the  maximum  and  minimum  values  of  X  and  .  If  the 
segment  does  not  intersect  the  region  being  mapped,  the  program  skips 
immediately  to  the  next  segment.  To  accommodate  this  the  header  must  also 
contain  the  relative  location  of  the  next  segment  in  core.^  For  convenience 
it  also  contains  the  length  in  number  of  points  of  the  current  segment. 

A  segment  may  cross  the  boundary  of  a  specified  map  region  any  number 
of  times  which  creates  intersection  problems  that  have  some  relationship 
to  the  intersection  problem  of  the  last  section  but  are  logically  more 
complicated  and  their  solutions  will  not  be  presented  here. 

Block  or  record  length  will  normally  be  512  words  long.  Thus,  each 
block  could  contain  up  to  1024  points  (four  "numbers"  are  contained  in 
each  word  and  two  "numbers"  are  required  to  describe  each  point  on  the 
globe)  .  If  a  segment  is  not  completed  in  a  block,  it  continues  at  the 
first  word  of  the  following  block.  Each  latitude  is  recorded  as  a  14  bit 
byte  and  each  longitude  as  15  bits.  This  permits  a  resolution  of  greater 
than  one  minute  (actually  to  1/64  of  a  degree)  in  both  latitude  and  longitude. 

The  values  are  stored  as  binary  integers.  In  latitude,  for  instance 

14 

the  south  pole  is  represented  by  "0"  and  north  pole  as  2  -1  (in  binary 

4 

If  the  current  segment  extends  beyond  the  end  of  the  current  record 
being  processed,  the  system  detects  this  and  waits  until  the  next 
record  is  in  core  before  processing  can  continue. 


11111111111111) .  The  integer  is  stripped  from  the  word  and  converted  to 
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a  floating  point  number  which  is  then  multiplied  by  2n(2  -1).  The  re¬ 

sultant  floating  point  number  is  latitude  (4>)  expressed  in  radians 
(-it/2  <  <  ir/2) .  For  longitude,  a  similar  scheme  is  employed  utilizing 

15  bits  instead  of  14. 


1  GO/ERASE 

2  GO/SAUE 

3  EXIT 

ENTER  OPTION! 

TYPE  OF  GRID: 

1- RECTANGLE 
2 .CAMEO 
RESPONSE:  2 
REFERENE  POINT 

DEC.  DEG  (LON/LAT):  30  30 

12.  RADIUS  :  90 

13.  BOUNDARY 

RESOLUTION:  100 
14. INTERIOR 

RESOLUTION:  1ES 

15.  SIZE:  4.3 

16. X  ORIGIN; 3 

17.  Y  ORIGIN:  1 
LONGITUDE  LINES: 

18.  FIRST-180 

19.  LAST180 

20. INCREAMENT10 
LATITUDE  LINES: 

21.  FIRST-90 

22 .  LAST90 

23. INCREAMENT10 
24. POLAR  CAP 
(DEC.  DEG.  ):  S 
CHANGES?:  0 


FIGURE  II-E-1 


Example  of  Interactive  Query  System 

Computer-user  dialogue  xeroxed  from  the  Kektronic  scope.  The  first 
option  is  supplied  by  the  system  software.  The  user's  response  "1"  after 
the  word  "option"  indicates  that  all  previous  display  is  to  be  removed  from 
the  scope.  Each  of  the  subsequent  questions  1  through  24  are  asked  by  the 
Regis  software.  The  answers  supplied  by  the  user  in  the  above  example  de¬ 
fine  the  graphic  II-B-2  b)  for  ^  =  90°  (for  more  discussion  see  text  II-E) 

This  dialogue  appears  in  the  right  hand  margin  of  the  scope.  The 
above  figure  is  a  150%  enlargement  of  the  actual  size  produced  on  the  xerox 
copy  of  the  screen. 


SUBSECTION  II -E 


THE  CUEING  SYSTEM:  INTERACTIVE  USE  OF  THE  SOFTWARE 

The  primary  method  of  creating  a  map  using  the  system  is  through  the 
textronic  graphics  terminal.  The  process  is  interactive,  i.e.,  after  cer¬ 
tain  preliminary  instructions  are  supplied  by  the  user,  the  program  sys¬ 
tematically  requests  information  about  the  map  and  its  construction  from 
the  user.  Figure  II-E-1  shows  a  xerox  copy  of  the  Tektronic  scope  during 
the  initial  input  stage. 

The  program, in  the  of  Figure  II-E-1,  first  requests  that  the  user 
indicate  the  type  of  boundary  desired  (this  request  for  information  is 
multiple  choice).  The  user  responds  "2"  -  requesting  a  cameo  boundary. 

The  program  then  requests  that  the  user  supply  the  coordinates  of  the 
reference  point;  the  user  responds  "30  30"  (or  30°  east,  30°  north,  the 
approximate  coordinates  of  Cairo).  This  process  continues  until  the  pro¬ 
gram  has  enough  information  to  draw  the  grid. 

At  this  point,  the  program  queries  "CHANGES?".  If  the  user,  after 
examining  the  scope  screen,  decides  that  all  answers  he  has  given  are  as 
desired,  he  responds  "0"  and  the  program  clears  the  screen  and  draws  the 
grid  -  in  the  case  of  the  example  it  is  the  grid  in  Figure  II-B-2(b),  If 
the  user  desires  to  change  any  of  his  previous  responses,  he  responds  with 
the  number  of  the  question.  The  program  re-asks  the  question  and  the  user 
supplies  the  correct  answer.  The  program  then  responds  again  with  "CHANGES?”. 
The  user  has  the  option,  at  this  time,  to  xerox  the  face  of  the  Tektronic 
scope  by  pressing  a  button  on  the  keyboard. 
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After  the  grid  is  drawn,  the  program  enters  a  new  phase.  In  the 
left  hand  margin  it  queries  the  user  about  contour  overlays.  After  each 
contour  has  been  specified,  it  is  drawn  on  the  map  (either  directly  or  as 
a  separate  overlay,  as  the  user  desires).  The  geopolitical  boundaries 
themselves  are  a  contour  (C.F.  Subsection  II-D) .  As  many  contours  may  be 
drawn  or  overlayed  to  a  map  as  is  desired.  When  the  user  has  finished 
with  all  of  the  contours,  he  indicates  this  to  the  program  which  then 
i  ..ters  a  third  phase.  Xeroxing  may  be  done  at  any  point  that  the  user 
desires . 

The  third  phase  of  the  program  allows  the  programmer  to  label  the 
map  and  create  a  ledger.  When  this  phase  is  completed,  the  user  is  given 
a  choice  of  the  disposition  of  the  map,  its  overlays  and  labeling.  The 
choices  are  simply  to  xerox  them,  send  them  to  the  computer  center's  CRT 
where  slides  will  be  produced,  or  to  obtain  pen  and  ink  drawings  of  them. 
When  sufficient  information  has  been  supplied  to  the  program,  this  is  done 
automatically  and  the  program  returns  to  the  starting  point  again,  ready 
for  a  new  map. 

It  is  also  possible  to  run  the  program  using  cards,  read  in  from  the 
computer  center  or  any  of  the  high  speed  terminals.  However,  it  is  some¬ 
what  awkward  to  prepare  the  input  for  this  type  of  operation. 
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SUBSECTION  II-F 


THE  STATE  OF  COMPLETION 

Not  all  features,  discussed  in  this  section,  exist  as  completed 
software  at  the  date  of  termination  of  this  contract.  A  main  framework 
of  the  program  exists  and  is  reliably  running.  Other  parts  are  completed 
as  "breadboard"  construction  and  some  are  still  in  the  experimental  stage. 
Nearly  everything  mentioned  in  the  preceding  Subsections  has  been  tried 
and  found  to  work  sufficiently  well  to  be  merged  with  the  framework.  The 
one  major  exception  to  this  is  labeling  and  ledgers.  Although  some  exper¬ 
imentation  has  been  done,  the  actual  options  to  be  included  have  never 
been  finally  decided  upon. 

The  contouring  features  have  been  designed  but  the  programming  is 
still  in  the  embryonic  stage.  This,  of  course,  includes  the  drawing  of 
geopolitical  boundaries  and  topographical  contours.  A  computer  map  of 
the  globe,  including  political  boundaries  of  North  America,  has  been  ob¬ 
tained  and  a  portion  of  it  has  actually  been  plotted  experimentally  but 
the  small  program  to  convert  it  to  the  format,  specified  in  Table  II-D-1, 
has  not  been  written.  The  map  itself  is  of  coarse  resolution,  particularly 
for  mesoscale  work.  Work  has  been  in  progress  improving  the  interactive 
cueing  system  and  improving  the  grid  system. 

The  plots,  included  in  this  report,  required  less  than  two  seconds 
apiece  to  produce.  It's  expected  that  the  computer  maps  or  other  contours 
will  take  approximately  the  same  amount  of  time  or  less. 
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Certain  ideas,  not  discussed  previously  in  this  report,  have  been 
investigated.  They  include  the  making  of  motion  pictures  displaying  the 
development  of  global  or  large  scale  meteorological  phenomenon  and  the 
drawing  of  three  dimensional  surfaces  using  the  Lambert  azimuthal  equal 
area  projectioi 
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